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Defective-interfering (DI) RNAs of RNA viruses, in general, are generated and continue to evolve in size during serial 
undiluted passages of viruses. This evolution was thought to occur by independent generation of DI RNAs during virus 
Passages and subsequent selection of new DI RNAs under new cellular conditions. Here we demonstrate that recombi- 
nation between the old DI RNA and the helper viral RNA can be one of the mechanisms for natural DI RNA evolution. A 
mouse hepatitis virus (MHV) DI RNA, DissE RNA, was transcribed jn vitro and transfected into a mouse cell line 
infected with a different MHV strain (A59), which is distinguishable from the original natural helper MHV JHM). During 
subsequent Serial undiluted passages of the harvested virus, several novel Di RNA species were generated, while the 
original DIssE RNA disappeared by passage 11. cDNA cloning and sequence analysis of one of these novel DI RNAs, 
designated DI-2, revealed that it is composed of four discontinuous regions of the genomic sequence and is different 
from the structure of the original DilssE RNA. Sequence comparison among DI-2, DissE, and helper MHV-A59 RNAs 
showed that DI-2 sequence is similar to DIssE in the first and second regions, but similar to the helper A59 virus in the 
third and last regions. Thus, this DI RNA was generated by RNA recombination between the original DIssE RNA and the 
helper virat RNA. These results indicate that recombination between DI RNA and helper virus RNA can be involved in 


the natural evolution of D1! RNAs. 


Most RNA viruses generate defective-interfering (DI) 
RNAs upon serial passages at high multiplicity of infec- 
tion. These DI RNAs usually consist of various noncon- 
tiguous regions of the wild-type viral RNA genome and 
are thought to be generated by RNA polymerase jump- 
ing during RNA transcription (7, 2). DI RNAs generally 
replicate more efficiently than the nondefective viral 
RNA and suppress the replication of the latter. How- 
ever, upon continuous passage of virus and Cells, the 
relative ratio of the DI RNA and nondefective viral RNA 
fluctuates, and new DI RNA species sometimes ap- 
pear, replacing the original DI RNA. This type of evolu- 
tion of DI RNA has been noted in vesicular stomatitis 
virus (3, 4) and mouse hepatitis virus (MHV) (5). The 
evolution of DI RNA species was thought to be the 
result of selection of a new Di RNA during serial pas- 
sages, which altered the virus—cell equilibrium (6) and 
favored the new type of DI RNA. However, the mecha- 
nism of the generation of the new DI RNA was not 
clear. These new DI RNAs could be generated de novo 
from the helper viral RNA, independenily of the existing 
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DI RNA, or could be the result of recombination or rear- 
rangement of sequences between the existing DI 
RNAs and helper virus RNA. These two possibilities 
could not be distinguished because the primary nu- 
cleotide sequences of the Di RNA and helper virus 
RNA are identical, making the origin of new DI RNAs 
impossible to determine. In this report, we addressed 
this issue by using an in vitro synthesized DI RNA of 
MHV to transfect into cells preinfected with a heterolo- 
gous MHV, which is different from the original helper 
virus; thus, the sequences of transfected DI RNA and 
helper viral RNA were distinguishable. The evolution of 
DI RNA was followed during serial virus passages and 
was found to result from recombination between the 
transfected Dt RNA and helper virus RNA. Thus, we 
conclude that RNA recombination plays a major role in 
the natural evolution of DI RNAs. 

MHV, a member of Coronaviridae, is an enveloped 
virus containing a 31-kb, single-stranded, positive- 
sense genomic RNA (7, 8). MHV synthesizes seven to 
eight mRNAs that have a 3'-coterminal, nested-set 
structure (9, 10) and an identical 5’-end leader se- 
quence of 72 to 77 nucleotides, which is derived from 
the 5’ end of the genome (77-73). The MHV mRNAs 
probably are synthesized by a discontinuous transcrip- 
tion mechanism involving polymerase jumping (14). 
Unlike other single-stranded RNA viruses, MHV under- 
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goes RNA recombination at a very high frequency, 
which is almost as high as that of viruses with seg- 
mented RNA genomes (75, 76). This phenomenon 
suggests that discontinuous RNA synthesis and occa- 
sional copy-choice switching of RNA templates may 
also be involved in the replication of coronavirus RNA. 
Probably as a result of discontinuous RNA synthesis, 
DI RNA is readily generated from MHV during virus 
passages at high multiplicity of infection (77). Several 
different kinds of DI RNA have been identified, and the 
dominant Di RNA species varies with different passage 
levels (5). Some of these DI RNAs have been molecu- 
larly cloned and sequenced (78-20). All of them are 
composed of several noncontiguous genomic regions 
and contain the 5’ and 3’ ends of the parental MHV 
genome (78-20). 

Two particular DI RNA species have been exten- 
sively studied; DIlssE RNA was generated from JHM 
strain of MHV after 16 serial undiluted passages (5) in 
DBT cells, an astrocytoma cell line (27). This DI RNA is 
2.2 kb long and is composed of three discontinuous 
regions of JHM genomic RNA (78). Another one, DissF 
RNA, was generated also from JHM after five additional 
undiluted passages of DissE-containing virus, when 
DissE RNA became undetectable (5). This RNA is 3.6 
kb long and contains five discontiguous regions, some 
of which, including the 5’ and 3’-ends and part of the 
second domain, are identical to DissE {79}. The se- 
quence relationship between DIssE and DissF RNAs, 
particularly the fact that they have the same junction 
site between the first and second domains in these two 
RNAs, suggests a possibility that DissF may have been 
directly derived from DissE rather than generated inde- 
pendently. However, this possibility could not be fur- 
ther examined since both RNA species were gener- 
ated from the cells infected with the same helper virus. 
Nevertheless, these RNAs offer an experimental sys- 
tem to examine the mechanism of evolution of DI RNAs 
in MHV-infected cells. 

DissE RNA was transcribed from a pT7 vector, DE25 
(22), by T7 RNA polymerase and transfected into DBT 
cells which had been infected with A59, an MHV strain 
different from the original helper virus JHM strain. Previ- 
ously, it has been demonstrated that the /n vitro tran- 
scribed DE25 RNA can replicate in the presence of a 
helper MHV and, despite the absence of a specific 
packaging signal, can be passaged with high efficiency 
(22). The viruses were harvested from the DE25-trans- 
fected and A59-infected cells and passaged serially in 
DBT cells without dilution of virus samples. A total of 
11 passages was carried out. °?P-labeled intracellular 
viral RNA patterns at selected passage levels were 
then examined after denaturation of RNA with glyoxal 
by agarose gel electrophoresis. Figure 1 shows that at 
passage 3, an RNA with a size equivalent to that of 
DissE was the major RNA species, while the synthesis 
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Fig. 1. Agarose gel electrophoresis of glyoxal-denatured intracel- 
lular RNAs of DissE-transfected cells at different passage levels. 
Plasmid DE25 (22}, which contains the T7 promoter sequence pre- 
ceding the complete DissE cDNA, was transcribed as described 
previously (22). The DIssE RNA was transfected into MHV A59-in- 
fected L2 cells by DEAE-dextran as previously described (34). The 
virus was harvested from the media and serial passages were 
carried out on DBT cells using undiluted virus. *#P-labeled intracellu- 
lar viral RNA was extracted as described previously (35). The arrow- 
head and arrow represent DissE and Dl-2 RNAs, respectively. The 
number over each lane denotes the passage level, and in the lane of 
passage level 0, only the helper virus, MHV A59, was propagated in 
the cells. The intracellular RNA species of the standard MHV A539 
are denoted on the side of the gel. 


of most of the MHV-A59-specific mRNAs was inhib- 
ited. At passage 7, very little DissE RNA and helper 
virus mRNAs could be detected. Only a small amount 
of genomic RNA and a new RNA in between mRNAs 6 
and 7 could be detected. Interestingly, at passage 11, 
Jarge amounts of viral RNA synthesis resumed, and 
several new RNA species of different sizes were de- 
tected; three of them migrated between MHV-A59 
MRNA 3 and 4, and another new RNA species (DI-2) 
was slightly smailer than MHV-A59 mRNA 5. 

To study the origin of these new RNA species, we 
cloned several of them. We first focused on the most 
prominent RNA, DI-2 (arrow, Fig. 1), since it is the small- 
est and most prominent new RNA and has apparently 
different size from DIssE. The DI-2 RNA from the intra- 
cellular RNA at passage 11 was purified from the gel as 
previously described (22) and used for oligo(dT)-primed 
cDNA synthesis. The double-stranded DNAs were 
cloned into EcoRI-linker-ligated pBluescript according 
to the published procedures (79). cDNA clones were 
screened by colony hybridization using DissE RNAasa 
probe (79). Several overlapping cDNA clones were iso- 
lated and their structure was analyzed by sequencing. 
Any sequence gaps and ambiguous regions were fur- 
ther examined by reverse transcription-polymerase 
chain reaction (RT-PCR) using specific primers to make 
cDNA clones covering the gaps. Finally, every se- 
quence jumping site was further confirmed by RT-PCR 
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AAATCTAATCTAATCTARACGGCA 80 


DI-2 CTICCTIGCETGTCCATGCCCGTGGGCCTGGTCTIGTCATAGTGCTGACATTITGTGGTICCTTGACTTTCGITCTCIGCCA 160 


GATGGGCAAATACGGTCICGGCTTCAA 240 


DI-2 ATGGGCCCCAGAATTTCCATGGATGCTTICCGAACGCATCGGAGAAGT TGGGTAACCCTGAGAGGTCAGAGGAGGATGGGT 320 


DI-2 TPITGCCCCTCIGCTGCGCAAGAACCGAAAGT TAAAGGAAAAACTTTGGTTAATCACGTGAGGGTGGATIGTAGCCGGCTT 400 


DI-2 CORCCOTESGAGT OCIS OT 2 CAGT COCCOATARTCCOTGATATE TI TG TTGACEAGEATCCCCACAACCTOGAGGCCTC 480 


DI-2 See ea Ne TET eT AASCCATCCARGCEC TICE CT OR TC RGCCATCSOCTARGT 560 


Tv: 


DI-2 TGQGTG TEC TOCOTARRAC TOC REC CATCGCGT RG TECANGCOCTICTOCCTG Te SAACACCAGEGAGTOCOTITGTGRG 640 


800 


880 


DI-2 GATGAAAACCAAGATGATGATGCTGACGATCCTGTAGTCCT TGTCGCCGATACCCAAGAAGAGGACGGCGTIGCCAAGGA 960 


DTM aR 3206s ced food oe ae rece ea Rai abe phavaravare eR aoe, 


Sauk everaleve’ehactlodantrg, ste. grate cede wesc conmiaceTy, wieha een Ge: 


DI-2 GCAGGTTGATTCGGCTGATTCGGAAATTTGTGTTGCGCACACTGGTGGTCAAGAAATGACTGAGCCTGATGTCGTCGGAT 1040 


DI-2 CTCAAACTCCCATCGCCTCTGCTGAGGAAACCGAAGTCGGTGAGGCATGCGACAGGGAAGGGATTGCTIGAGGTCAAGGCA 1120 


DI-2 ACTGTGTGTGCTGATGCTTTAGATGCCTGCCCCGATCAAGTGGAGGCATTTGATATTGAAAAGGTIGAAGACTCTATCTT 1200 


DI-2 AAGTGAGCTTCAAACCGAACTTAATGCGCCCGCGGACAAGACCTATGAGGATGTICTIGGCATTCGATGCCATATACTCAG 1260 


Fig. 2. Nucleotide sequence comparison of Dl-2 with the corresponding regions of DissE and MHV A59 genomic RNA. Sequences of DissE, 
MHV A59 genes 1a, 1b, and gene 7 were derived from published data (78, 24-26). The open box indicates AUG initiation codon and the 
underline indicates termination codon of ORF of DI-2. Nucleotide insertions are identified with brackets underneath. Solid triangles indicate the 
sites where fusion of discontinuous sequences occurred. Numbers of nucleotides from the 5’-end are according to the DissE sequence (78). 
cDNA cloning of DI-2 was carried out according to the procedure described previously (79) except for linkers and cloning vector. EcoRI linkers 
and EcoR|-digested plasmid pBluescript Sk+ (Stratagene) were used for CDNA cloning. Recombinant DNAs were screened by colony hybridiza- 
tion (36) using a complete cDNA clone of DissE as the probe (22). For RT-PCR, the entire RNA was made into two overlapping PCR products by 
using two pairs of primers separately for the 5’-half and 3’-half of the D2 RNA. cDNA cloning by PCR was carried out as described previously 
(79). Sequencing was carried out by Sanger's dideoxyribonucleotide chain termination method (37). Since one of the primers used for PCR 
cloning is complementary to the 5'-end of the leader of MHV-JHM RNA, the sequence of the very 5’ end of DI-2 could not be determined. 


using primers across the junction sequences. Using 
these cDNA clones, the entire sequence of Dl-2 ex- 
cept the very 5’ end was obtained (Fig. 2). 

The sequence analysis of DI-2 RNA showed that this 
Di RNA is 2924 nucleotides long and composed of four 
regions derived from discontiguous regions of the 


MHV genome (Figs. 2 and 3). The entire third region 
and part of the fourth region of DI-2 are not present in 
DIssE. The first region (864 nucleotides) and the sec- 
ond region (445 nucleotides) represent a sequence 
corresponding to the same regions of DIlssE RNA, and 
the junction between these two regions is identical to 
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DI-2 AGACGTTGTCTGCATTCTATGCTGTAGGCCCACAACGTGCCGATTTGAATGGCGTAGTTGTGGAGAAAGTTGGAGATICT 1360 


DI-2 GATGTGGAATITIGGTITGCTGTGCGTAAAGACGGTGACGATGTTATCTTCAGCCGTACAGGGAGCCTTGAACCGAGCCA 1440 


DI-2 TTACCGGAGCCCACAAGGTAATCCGGGTGGTAATCGCGTGGGTGATCTCAGCGGTAATGAAGCICTAGCGCGTGGCACTA 1520 


DIi-2 TCTTTACTCAAAGCAGATTATTATCCICTTTCACACCTCGATCAGAGATGGAGARAGATTTTATGGATTTAGATGATGAT 1600 


DI-2 GTGTTCATTGCAAAATATAGTTTACAGGACTACGCGTTIGAACACGTTGTTTATGGTAGTTTTAACCAGAAGATTATIGG 1680 


DI-2 AGGTYTGCATTIGCTTATTGGCTTAGCCCGTAGGCAGCAAAAATCCAATCTGGTAATTCAAGAGTTCGIGACATACGACT 1760 


DI-2 CTAGCATTCATTCGTACTTTATCACTGACGAGAACAGTGGTAGTAGTAAGAGTGIGIGCACTGTITATIGATTIAITGITA 1840 


DI-2. GATGATTTIGTGGACATIGTAAAGTCCCTGAATCTARAGTGTGTGAGTAAGGTIGTTAATCGCGCTAGAAGCAGTICCAR 1920 


DI-2 CCAGCGCCAGCCTGCCTCTACTGTAARACCTGATATGGCCGAAGAAATI GCIGCICITGTTTTGGCTAAGCICGGTAAAG 2000 


DI-2 ATGCCGGCCAGCCCAAGCAAGTAACGAAGCAARAGTGCCAAAGAAGTCAGGCAGAARATTTTAAACAAGCCTCGCCAAAAG 2080 


DI-2. AGGACTCCAAACAAGCAGTGCCCAGTGCAGCAGTGTT TI GGAAAGAGAGGCCCCAATCAGAATTTIGGAGGCICTGAAAT 2160 


DI-2 GTTAAAACTTGGAACTAGTGATCCACAGTTCCCCATICTTGCAGAGTTGGCTCCAACAGTIGGIGCCTTCTICTITGGAT 2240 


DI-2 CTAAATTAGAATTGGTCAAAAAGAATICTGGTGGTGCTGATGAACCCACCAAAGATGTGTATGAGCTIGCAATATTCAGGT 2320 


DI-2 GCAGTTAGATTTGATAGTACTCTACCIGGTITIGAGACTATCATGAAAGTGTTGAATGAGAATTTGAATGCCTACCAGAA 2400 


‘TeAs 


GGATGGTGGTGCAGATGTGGTGAGCCCARAGCCCCARAGAAAAGGGCGTAGACAGGCTCAGGAARAGAAAGATGAAGTAG 2480 
‘AY . *GA-AG-G-C-A--CAAA-- -CTC-- 


DI-2 ATARTGTAAGCGTTGCARAGCCCAAAA GCTCTGTGCAGCGAAATGTAAGTAGAGAATTARCCCCAGAGGATAGAAGTCTG 2560 
G T CC. -CT 


DI-2 TIGGCTCAGATCCTTGATGATGGCGTAGTGCCAGA TGGGTTAGAAGATGACTCTAATGTGIARAGAGAATGAATCCTATG 2640 
DissE C- A: oF 


DI-2 TCGECECTCEGTEGTARCCCCTCGCGAGRARGTCGGGATAGGACACTCTCTATCAGARTGGATGICTTGCTGTCATARCA 2720 
Ae 


DI-2 GATAGAGAAGGTIGTGGCAGACCCTGTATCAATTAGTTGAAAGAGATTGCAAAATAGAGAATGTGTGAGAGAAGTTAGCA 2800 


DI-2 AGGTCCTACGTCTAACCATAAGAACGGCGATAGGCGCCCCCTGGGARGAGCTCACATCAGGGTACTATTCCTGCAATGCC 2680 


DI-2 CTAGTAARTGAATGAAGTIGATCATGGCCAATTGGAAGAATCAC 2924 


Fic. 2—Continued 


that of DissE. However, the second region of DI-2 is sponds to the sequence containing the MHV RNA 
shorter than that of DIssE. The third region was derived packaging signal as identified in DissF RNA (23). The 
fram the sequence corresponding to 20.2 to 20.8 kb fourth region represents 1023 nucleotides from the 3’ 


from the 5’ end of the genomic RNA. This region corre- end of the MHV genome. 
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Fig. 3. Schematic diagram of the structure of DI-2 RNA. The struc- 
ture of DI-2 is compared with the structure of the MHV A59 genomic 
RNA and DissE RNA. Genes 1 through 7 represent the seven genes 
ot MHV. Regions which represent DissE sequence and MHV A59 
genomic sequence are indicated by open arrows and solid arrows, 
respectively. 


To examine the origin of Dl-2 RNA, the Dl-2 se- 
quence was aligned with the previously published 
DissE (78) and A59 (24-26) sequences (Fig. 2). Since 
the third region and the first 436 nucleotides of the last 
region of Dl-2 are not present in DissE, they were com- 
pared only to the A59 genomic RNA sequence. The 
results showed that the first region of Dl-2 is almost 
identical to the corresponding sequence of DissE, with 
an identical junction site. The corresponding A59 se- 
quence has 29 nucleotide differences and four extra 
nucleotides. However, within the leader sequence, DI- 
2 RNA contains three UCUAA repeats, while the re- 
ported DIssE sequence contains four of them (78), and 
A539 genomic RNA contains two copies (17, 24). The 
heterogeneity of UCUAA copy numbers in the leader 
RNA has previously been reported for several DI RNAs 
{27), genomic RNAs (28), and mRNAs (29). This hetero- 
geneity probably resulted from leader RNA jumping 
during DI RNA replication. Besides, one nucteotide in 
the first region at 815 nucleotides from the 5’ end is 
different from that of both DIssE and MHV-A59. This 
could be explained by mutations during DI RNA replica- 
tion. The second region of DI-2 RNA is also identical to 
that of DlssE RNA and distinct from the corresponding 
A5Q9 sequence, which differs by 32 nucleotides in this 
region (S. Weiss, personal communication). This re- 
gion is smaller than the second region of DissE RNA. 
These results suggest that the first and second regions 
of Dl-2 RNA were directly derived from DIssE RNA. In 
contrast, the sequences of the third and fourth regions 
are identical to thase of the MHV-A59 genome except 
for one nucleotide at 54 nucleotides from the 3’ end of 
the genome. The corresponding region of DissE con- 
tains 28 nucleotide differences and 3 extra nucleo- 


tides. Since the third and fourth regions of DI-2 RNA 
contain an extra stretch of sequence which is not pres- 
ent in DIssE, and the sequence of the last region of 
DI-2 is identical to that of MHV-A59 genomic RNA and 
different from DissE, it is likely that the third and fourth 
regions of DI-2 were derived from MHV-A59 genome 
RNA. Thus, 0-2 RNA was clearly derived by recombi- 
nation between DissE and helper A59 virus RNA with a 
cross-over site in the second domain of DissE RNA, 
followed by additional polymerase jumping during 
copying of A59 RNA sequences. Since nucleotide 
2870 of DI-2 is identical to that of DissE, in contrast to 
A59, we cannot rule out the possibility that the 3' end of 
Dl-2 was derived from DissE resulting from an addi- 
tional recombination event. 

This result indicates that RNA recombination is at 
least one of the mechanisms for the natural evolution 
of DI RNAs. The possible occurrence of recombination 
between DI RNA and helper viral RNA has also been 
demonstrated in a recent study, in which marker muta- 
tions in DI RNA can be incorporated into the viral ge- 
nome by RNA recombination during virus replication 
(30). The DI-2 RNA is unique in that multiple recombina- 
tion events between nonhomologous sequences of D! 
RNA and heiper viral RNA (aberrant homologous re- 
combination (38)) were involved. Thus, both nonhamol- 
ogous and homologous recombination can occur be- 
tween the DI and helper viral RNAs. At the present 
time, we do not know whether DI-2 RNA was directly 
derived from recombination between DissE RNA and 
helper viral RNA by polymerase jumping during RNA 
synthesis, or indirectly from recombination between 
DissE RNA and an unidentified A59-derived DI RNA 
species. If itis the former, several polymerase jumping 
events must have occurred during the synthesis of DI- 
2 RNA. 

It is interesting to note that the third region of Dl-2 
RNA is not present in DIssE. However, DissF RNA, 
which is packaged into virus particles efficiently, also 
contains 276 nucleotides corresponding to the 5’ end 
of the third region of Dl-2 (79). This region contains the 
putative packaging signal for MHV RNA (23). Since 
DissF and DI-2 were independently derived using two 
different helper viruses, the presence of similar 
stretches of sequence in both RNAs suggests that this 
region is a recombinational hot spot or that the pres- 
ence of this sequence confers selective advantage. 
Similar sequences have also been found in another DI 
RNA derived from MHV-A59 (20). 

Like DIssE, Dl-2 has a single large open reading 
frame (ORF), which is 2415 bp long. The initiation and 
termination sites of the ORF are identical to those of 
ORF in DissE, although the ORF of DI-2 contains differ- 
ent regions from those of DIssE. The same ORF is con- 
tained in both Dl-2 and DissE in the first and second 
regions. As reported previously, the first 215 amino 
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acids correspond to the N terminus of the MHV gene 1 
product, and part of the protein is cleaved into a p28 
protein (37-33). The significance of this ORF in DI RNA 
replication is not yet clear. 
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